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I. Motivation 

Managing cloud services is a fundamental challenge in 
todays virtualized environments. These challenges equally face 
both providers and consumers of cloud services. The issue 
becomes even more challenging in virtualized environments 
that support mobile clouds. Cloud computing platforms such 
as Amazon EC2 provide customers with flexible, on demand 
resources at low cost. However, they fail to provide seam- 
less infrastructure management and monitoring capabilities 
that many customers may need. For instance, Amazon EC2 
doesn't fully support cloud services automated discovery and 
it requires a private set of authentication credentials. Sales- 
force. com, on the other hand, do not provide monitoring 
access to their underlying systems. Moreover, these systems 
fail to provide infrastructure monitoring of heterogenous and 
legacy systems that don't support agents. In this work, we 
explore how to build a cloud management system that com- 
bines heterogeneous management of virtual resources with 
comprehensive management of physical devices. We propose 
an initial prototype for automated cloud management and 
monitoring framework. Our ultimate goal is to develop a 
framework that have the capability of automatically tracking 
configuration and relationships while providing full event 
management, measuring performance and testing thresholds, 
and measuring availability consistently. Armed with such a 
framework, operators can make better decisions quickly and 
more efficiently. 

II. Challenges 

These tasks are achieved through an agentless monitoring 
of the cloud's infrastructure. While traditional network man- 
agement methods suffer from inherited difficulties (TJ, 0, 
implementing seamless network management and monitoring 
framework entails several new challenges: 

• Discovering the relationship of virtualized resources to 
underlying physical infrastructure. 

• Minimizing the overhead of monitoring and problem 
determination across a physical and virtualized infrastruc- 
ture. 

• Handling security-related constraints that may affect data 
collection is probably one of the most serious issues. 

> Response action should be taken regarding a particular 
virtual or physical device within the hard response dead- 
line time frame. In agentless-based monitoring systems, 


this can be insured only by implementing high number 
of threads, which in turn increases complexity. 
• Dealing with infrastructure management issues such as 
root-cause analysis becomes more complex. 

III. Design and Implementation 

We propose an event-based model where events are placed 
on an in-memory publish/subscribe bus on the Management 
Server, enabling a high throughput of events. 

The event bus architecture, depicted in Figure Q] enables 
any authorized mediator to create events on the bus, and any 
authorized consumer to access events from the bus. Events on 
the bus show current status of infrastructure components. 
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Fig. 1 . An initial prototype of our cloud management and monitoring system. 

The framework will provide a set of APIs to simplify cre- 
ation of consumer and mediator applications. A set of language 
extensions and Web services will be used to enable Perl, Ruby, 
or Java scripts to create events on the bus. To support high 
level of reliability and scalability, the Distributed Collector 
subsystem will be multi-threaded. Furthermore, events will are 
normalized from any source into a common format, which will 
enable consistent processing. 
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